Bloomberg is the market leader for financial data and workflows globally, servicing hundreds of thousands of financial professionals across a wide variety of roles. The Platform Security group, in conjunction with our Information Security thought leadership and Security Operations groups, aims to keep the company's information and products secure, enables developers to secure the various technological solutions they build, provides the data and intelligence needed to ensure our systems are protected at all times, and enables workplace agility without compromising security.
The Secure App Hosting Infrastructure team is responsible for building and managing infrastructure that is used for hosting various remote access applications. These business critical applications are used by Bloomberg customers as well as employees worldwide. Our team ensures reliability and uptime of the infrastructure and services by applying modern software and system engineering practices. To operate at scale, we aim to provision our infrastructure using infrastructure-as-code patterns and using various automation techniques.
One of the flagship products that run on the infrastructure managed by our team is Bloomberg Anywhere which provides users ability to access Bloomberg Terminal from anywhere in the world. Bloomberg Anywhere handles approximately 1 million unique sessions a month. We run this workload in a hybrid environment composed of on-prem and public cloud that spans several geographic regions. Our team develops automation, monitoring and processes to make this environment scalable and achieve 99.99% uptime. The team keeps a close watch on capacity and performance to deliver premium service to our customers.
Apart from this, we are also responsible for managing solutions that empower Bloomberg employees to work from anywhere. These solutions are used by thousands of users on a daily basis and are critical to their productivity.
We are looking for an engineering leader passionate about reliability and performance along with excellent people leadership skills. We expect our leaders to create inclusive, self-managing teams, which take pragmatic decisions and make technology a means to an end. We work with a wide variety of partners inside the organization, and open collaboration is in the DNA of the company. We expect our leaders to be proactive, build effective relationships based on trust, and think holistically about the company and its clients at large.
We'll trust you to:
• Help establish and apply SRE best practices to our solutions
• Provide technical leadership to engineer solutions to monitor the health, availability, and capacity of our environment and software using industry standard tools and practices
• Assist in architecting large-scale secure solutions for our teams products
• Define, measure, and achieve service level objectives as appropriate for the systems the team manages
• Hire, retain and nurture diverse talent
• Share on call responsibilities with the team of 3-5 engineers
You'll need to have:
• 4+ years of experience in a relevant infrastructure engineering or SRE role
• 2+ years of experience leading software developers and/or SREs
• A degree in Computer Science, Engineering or similar field of study or equivalent work experience.
• Ability to build reliable infrastructure solutions used by a wide range of applications
• Experience with using configuration and orchestration software
• Understanding of Linux or Windows systems including networking
• Experience with cloud services like AWS, Azure and related tools such as Terraform, Packer etc
• Familiarity with any programming language for development and testing
• Ability to create a team culture where individuals feel safe expressing themselves and trying new things
• Ability to manage multiple stakeholders and build long standing relationships.
• Ability to network across a wide variety of internal teams regardless of relation to main responsibility
We would love to see:
• Prior experience or knowledge of remote access technologies
• Experience working in the security domain
• Experience troubleshooting scalable distributed systems
• Experience with metrics and monitoring(e.g. grafana, splunk, humio, prometheus